157 research outputs found

    Weighted-distance sliding windows and cooccurrence graphs for supporting entity-relationship discovery in unstructured text

    Get PDF
    The problem of Entity relation discovery in structured data, a well covered topic in literature, consists in searching within unstructured sources (typically, text) in order to find connections among entities. These can be a whole dictionary, or a specific collection of named items. In many cases machine learning and/or text mining techniques are used for this goal. These approaches might be unfeasible in computationally challenging problems, such as processing massive data streams. A faster approach consists in collecting the cooccurrences of any two words (entities) in order to create a graph of relations - a cooccurrence graph. Indeed each cooccurrence highlights some grade of semantic correlation between the words because it is more common to have related words close each other than having them in the opposite sides of the text. Some authors have used sliding windows for such problem: they count all the occurrences within a sliding windows running over the whole text. In this paper we generalise such technique, coming up to a Weighted-Distance Sliding Window, where each occurrence of two named items within the window is accounted with a weight depending on the distance between items: a closer distance implies a stronger evidence of a relationship. We develop an experiment in order to support this intuition, by applying this technique to a data set consisting in the text of the Bible, split into verses

    Oii-web: An interactive online programming contest training system

    Get PDF
    In this paper we report our experience, related to the online training for the Italian and International Olympiads in Informatics. We developed an interactive online system, based on CMS, the grading system used in several major programming contests including the International Olympiads in Informatics (IOI), and used it in three distinct context: training students for the Italian Olympiads in Informatics (OII), training teachers in order to be able to assist students for the OII, and training the Italian team for the IOI. The system, that is freely available, proved to be a game changer for the whole italian olympiads in informatics ecosystem: in one year, we almost doubled the participation to OII, from 13k to 21k secondary school students. The system is developed basing on the Contest Management System (CMS, http://cms- dev.github.io/), so it is highly available to extensions supporting, for instance, the pro- duction of feedback on problems solutions submitted by trainees. The system is also freely available, with the idea of allowing for support to alternative necessities and developmen

    Risk Assessment for Venous Thromboembolism in Chemotherapy-Treated Ambulatory Cancer Patients: A Machine Learning Approach

    Get PDF
    OBJECTIVE: To design a precision medicine approach aimed at exploiting significant patterns in data, in order to produce venous thromboembolism (VTE) risk predictors for cancer outpatients that might be of advantage over the currently recommended model (Khorana score). DESIGN: Multiple kernel learning (MKL) based on support vector machines and random optimization (RO) models were used to produce VTE risk predictors (referred to as machine learning [ML]-RO) yielding the best classification performance over a training (3-fold cross-validation) and testing set. RESULTS: Attributes of the patient data set ( n = 1179) were clustered into 9 groups according to clinical significance. Our analysis produced 6 ML-RO models in the training set, which yielded better likelihood ratios (LRs) than baseline models. Of interest, the most significant LRs were observed in 2 ML-RO approaches not including the Khorana score (ML-RO-2: positive likelihood ratio [+LR] = 1.68, negative likelihood ratio [-LR] = 0.24; ML-RO-3: +LR = 1.64, -LR = 0.37). The enhanced performance of ML-RO approaches over the Khorana score was further confirmed by the analysis of the areas under the Precision-Recall curve (AUCPR), and the approaches were superior in the ML-RO approaches (best performances: ML-RO-2: AUCPR = 0.212; ML-RO-3-K: AUCPR = 0.146) compared with the Khorana score (AUCPR = 0.096). Of interest, the best-fitting model was ML-RO-2, in which blood lipids and body mass index/performance status retained the strongest weights, with a weaker association with tumor site/stage and drugs. CONCLUSIONS: Although the monocentric validation of the presented predictors might represent a limitation, these results demonstrate that a model based on MKL and RO may represent a novel methodological approach to derive VTE risk classifiers. Moreover, this study highlights the advantages of optimizing the relative importance of groups of clinical attributes in the selection of VTE risk predictors

    Ensuring sample quality for biomarker discovery studies - Use of ict tools to trace biosample life-cycle

    Get PDF
    The growing demand of personalized medicine marked the transition from an empirical medicine to a molecular one, aimed at predicting safer and more effective medical treatment for every patient, while minimizing adverse effects. This passage has emphasized the importance of biomarker discovery studies, and has led sample availability to assume a crucial role in biomedical research. Accordingly, a great interest in Biological Bank science has grown concomitantly. In biobanks, biological material and its accompanying data are collected, handled and stored in accordance with standard operating procedures (SOPs) and existing legislation. Sample quality is ensured by adherence to SOPs and sample whole life-cycle can be recorded by innovative tracking systems employing information technology (IT) tools for monitoring storage conditions and characterization of vast amount of data. All the above will ensure proper sample exchangeability among research facilities and will represent the starting point of all future personalized medicine-based clinical trials

    The effect of social media and infodemic on mental health during the COVID-19 pandemic: results from the COMET multicentric trial

    Get PDF
    On January 30, 2020, the World Health Organization (WHO) declared the status of pandemic due to the COVID-19 infection. The initial phases of the pandemic were characterized by uncertainty and public fears. In order to cope with such unexpected conditions, people adopted different coping strategies, including search for information, accessing Internet, and using social media. The present study based on the COMET collaborative research network aims to: (1) assess use of Internet and of social media among the Italian general population; (2) explore differences in web usage between people with pre-existing mental disorders and the general population; (3) identify changes over time in social media usage along the phase 1 of the pandemic; (4) identify the clinical, socio-demographic and contextual predictors of excessive use of social media. A significant increase in time spent on Internet, with an average time of 4.8 ± 0.02 h per day, was found in the global sample of 20,720 participants. Compared with the general population, Internet use was significantly higher in people with pre-existing mental disorders (5.2 ± 0.1 h vs. 4.9 ± 0.02; p < 0.005). According to the multivariate logistic regression model, the risk of excessive use of social media and Internet was significantly higher in people with moderate levels of depressive symptoms (OR: 1.26, CI 95%: 0.99 to 1.59, p < 0.0.005); while protective factors were being students (OR: 0.72, CI 95%: 0.53 to 0.96, p < 0.0029) and living in central Italy (OR: 0.46, CI 95%: 0.23 to 0.90, p < 0.002). The evaluation of social media and Internet use by the general population represents a first step for developing specific protective and supportive interventions for the general population, including practical suggestions on how to safely use Internet and social media

    COVID-19-Related Social Isolation Predispose to Problematic Internet and Online Video Gaming Use in Italy

    Get PDF
    COVID-19 pandemic and its related containment measures have been associated with increased levels of stress, anxiety and depression in the general population. While the use of digital media has been greatly promoted by national governments and international authorities to maintain social contacts and healthy lifestyle behaviors, its increased access may also bear the risk of inappropriate or excessive use of internet-related resources. The present study, part of the COVID Mental hEalth Trial (COMET) study, aims at investigating the possible relationship between social isolation, the use of digital resources and the development of their problematic use. A cross sectional survey was carried out to explore the prevalence of internet addiction, excessive use of social media, problematic video gaming and binge watching, during Italian phase II (May-June 2020) and III (June-September 2020) of the pandemic in 1385 individuals (62.5% female, mean age 32.5 +/- 12.9) mainly living in Central Italy (52.4%). Data were stratified according to phase II/III and three groups of Italian regions (northern, central and southern). Compared to the larger COMET study, most participants exhibited significant higher levels of severe-to-extremely-severe depressive symptoms (46.3% vs. 12.4%; p &lt; 0.01) and extremely severe anxiety symptoms (77.8% vs. 7.5%; p &lt; 0.01). We also observed a rise in problematic internet use and excessive gaming over time. Mediation analyses revealed that COVID-19-related general psychopathology, stress, anxiety, depression and social isolation play a significant role in the emergence of problematic internet use, social media addiction and problematic video gaming. Professional gamers and younger subjects emerged as sub-populations particularly at risk of developing digital addictions. If confirmed in larger and more homogenous samples, our findings may help in shedding light on possible preventive and treatment strategies for digital addictions

    The effect of social media and infodemic on mental health during the COVID-19 pandemic: results from the COMET multicentric trial

    Get PDF
    On January 30, 2020, the World Health Organization (WHO) declared the status of pandemic due to the COVID-19 infection. The initial phases of the pandemic were characterized by uncertainty and public fears. In order to cope with such unexpected conditions, people adopted different coping strategies, including search for information, accessing Internet, and using social media. The present study based on the COMET collaborative research network aims to: (1) assess use of Internet and of social media among the Italian general population; (2) explore differences in web usage between people with pre-existing mental disorders and the general population; (3) identify changes over time in social media usage along the phase 1 of the pandemic; (4) identify the clinical, socio-demographic and contextual predictors of excessive use of social media. A significant increase in time spent on Internet, with an average time of 4.8  ±  0.02 h per day, was found in the global sample of 20,720 participants. Compared with the general population, Internet use was significantly higher in people with pre-existing mental disorders (5.2  ± 0.1 h vs. 4.9  ±  0.02; p &lt; 0.005). According to the multivariate logistic regression model, the risk of excessive use of social media and Internet was significantly higher in people with moderate levels of depressive symptoms (OR: 1.26, CI 95%: 0.99 to 1.59, p &lt; 0.0.005); while protective factors were being students (OR: 0.72, CI 95%: 0.53 to 0.96, p &lt; 0.0029) and living in central Italy (OR: 0.46, CI 95%: 0.23 to 0.90, p &lt; 0.002). The evaluation of social media and Internet use by the general population represents a first step for developing specific protective and supportive interventions for the general population, including practical suggestions on how to safely use Internet and social media
    • …
    corecore